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Rr , Abstract 

S^ ' In this paper we discuss several heuristic strategies which allow one 

to solve the Whitehead's minimization problem much faster (on most in- 
puts) than the classical Whitehead algorithm. The mere fact that these 
strategies work in practice leads to several interesting mathematical con- 
jectures. In particular, we conjecture that the length of most non- minimal 
elements in a free group can be reduced by a Nielsen automorphism which 
can be identified by inspecting the structure of the corresponding White- 
head Graph. 

^ ! 1 Introduction to Whitehead method 

f^S | Let X — {xi, . . . , Xm} be a finite alphabet, X^ 1 = {x^ 1 | x £ X} be the set of 

SO ■ formal inverses of letters from X and X ±x = X U X~ x . A word w = y± . . .y n 

in the alphabet X is called reduced if ?/, ^ yi+\ for i = 1, . . . , n — 1 (here 
we assume that (x^ 1 )^ 1 — x). Applying reduction rules xx~ l — > e,x~ 1 x — > e 
(where e is the empty word), one can reduce each word w in the alphabet X ±x 
to a reduced word w. The word w is uniquely defined and does not depend on 
a particular sequence of reductions. Denote by F — F(X) the set of reduced 
words over X ±l . The set F forms a group with respect to the multiplication 
u ■ v — uv, which is called a free group with basis X. The cardinality |X| is 
called the rank of F(X). Sometimes we write F n instead of F to indicate that 
the rank of F is equal to n. 

A bijection <j) : F — > F is called an automorphism of F if <fi(uv) — (j)(u)(f>(v) 
for every u,v £ F. The set Aut(F) of all automorphisms of F forms a group with 
respect to composition of maps. Every automorphism <f> £ Aut(F) is completely 
determined by the images 4>{x) of elements x £ X. The following two subsets 
of Aut(F) play an important part in group theory and topology. 

An automorphism t £ Aut(F) is called a Nielsen automorphism if for some 
x E X t fixes all elements y £ X,y ^ x and maps x to one of the elements x^ 1 , 
y ±1 x, xy ±x . By iV(A) we denote the set of all Nielsen automorphisms of F. 

An automorphism t £ Aut(F) is called a Whitehead automorphism if cither 
£ permutes elements of X ±x or £ fixes a given element a £ X and maps 



each element x G X ±1 ,x ^ a ±x to one of the elements x, xa, a~ 1 x 7 or a~ 1 xa. 
Obviously, every Nielsen automorphism is also a Whitehead automorphism. By 
W(X) we denote the set of non-trivial Whitehead's automorphisms of the second 
type. 

Observe that 

\N(X)\ = 4n(n - 1), \W(X)\ = 2n4 (,l ~ 1) - 2n 

where n = \X\ is the rank of F. 

It is known 0] that every automorphism from Aut(F) is a product of finitely 
many Nielsen (hence Whitehead) automorphisms. 

The automorphic orbit Orb(w) of a word w G F is the set of all automorphic 
images of w in F: 

Orb(w) = {v e F \3ip e Aut(F) such that w v = v}. 

A word w G F is called minimal (or automorphically minimal) if |iu| < |u> v 
for any (/? G Airf(.F). By t0 m «, we denote a word of minimal length in Orb(w). 
Notice that w m i„ is not unique. 

Problem 1.1 (Minimization Problem (MP)) For a word u G F find an 
automorphism (p G Aut(F) such that uip — u min . 

In 1936 J. H. C. Whitehead proved the following result which gives a solution 
to the minimization problem |7j. 

Theorem 1.1 (Whitehead) Let u, v G F n (X) and v G Orb(u). If \u\ > \v\, 
then there exists t G W(X ) such that 

\u\ > \ut\. 

An automorphism <f> G Aut(F) is called a length-reducing automorphism for 
a given word u G F if \u<f>\ < \u\. The theorem above claims that the finite set 
W{X) contains a length-reducing automorphism for every non-minimal word 
u G F. This allows one to design a simple search algorithm for (MP). 

Let u G F. For each t G W(X) compute the length of the tuple ut until 
\u\ > \ut\, then put t\ — t,u\ — ut\. Otherwise stop and output u m in — u. The 
procedure above is called the Whitehead Length Reduction routine ( WLR) . Now 
Whitehead Reduction Algorithm (WRA) proceeds as follows. Repeat WLR on 
u, and then on u\, and so on, until on some step k WRL gives an output u m i n . 
Then ut\ . . . tk-i = u m i n , so (f> = ii . . . tu-\ is a required automorphism. 

Notice, that the iteration procedure WRA simulates the classical greedy 
descent method (t\ is a successfull direction from u, t 2 is a successfull direction 
from u\, and etc.). Thcorem ll.il guarantees that the greedy approach will always 
converge to the global minimum. 

Clearly, there could be at most \u\ repetitions of WLR on an input u G F 

\u\ > \utl\ > ... > \uti...tl\ = Umin, I < \u\. 



Hence the worst case complexity of the algorithm WRA is bounded from above 

by 

cA n \u\ 2 , 

where A n = 2n<^ n ~^ —In is the number of Whitehead automorphisms in W(X). 
Therefore, in the worst case scenario, the algorithm seems to be impractical for 
free groups with large ranks. One can try to improve on the number of steps 
which takes to find a length-reducing automorphism for a given non-minimal 
element from F. In this context the main question of interest is the complexity 
of the following 

Problem 1.2 (Length Reduction Problem) For a given non-minimal ele- 
ment «gF find a length-reducing automorphism. 

We refer to [5] for a general discussion on this problem. 

In the next section we give some empirical evidence that using smart strate- 
gies in selecting Whitehead automorphisms t s W(X) one can dramatically 
improve the average complexity of WRA in terms of the rank of a group. 

2 Heuristics for Length Reduction Problem 

2.1 Nielsen first 

The first heuristic comes from a very naive approach: replace W{X) by N(X) in 
the Whitehead length reduction routine WLR and denote the resulting routine 
by NLR. Since the size of N(X) is quadratic and the size of W(X) is exponential 
in the rank of F, the algorithm NRA may give a real speedup in computations. 
However, it is known (see 0]) that the Whitehead theorem above does not hold 
after replacement of W(X) by N(X). Therefore, the algorithm NRA will not 
give the correct answer at least on some inputs. But this is not the end of the 
story. Now the question is how often the length reduction routine NLR gives the 
correct answer? 

To get some insights, we perform a simple experiment. For free groups F$, F4 
and F5 we generate test sets of non-minimal elements of Whitehead Complexity 
1 (see definitions in j^j), described in Tabled For a detailed description of the 
data generation procedures we refer to [3j- 



Dataset 


Group 


Dataset Size 


Min. length 


Avg. length 


Max. length 


£3 


^3 


10143 


3 


558.2 


1306 


D 4 


F A 


10176 


4 


570.9 


1366 


D 5 


F 5 


10165 


5 


581.3 


1388 



Table 1: Statistics of sets of non-minimal elements. 

For each set D n we compute the fraction of elements from D n which have 
length-reducing Nielsen automorphisms. The results of the computations to- 
gether with the corresponding 95% confidence intervals are given in Table |21 



We can see that most of the words have been reduced by Nielsen automor- 
phisms. We would like to mention here, that it can be shown statistically that 
increasing the length of elements in the datasets does not significantly change 
the results of experiments. 



Dataset 


D 3 


D 4 


D 5 


Fraction 


0.998 


0.997 


0.998 


95% Conf. Interval 


[0.9970,0.9988] 


[0.9957,0.9979] 


[0.9970,0.9988] 



Table 2: Fraction of elements in the sets D3, D4 and D5 with length-reducing 
Nielsen automorphisms. 

Based on these experiments one can speculate that with very high probability 
Nielsen automorphisms reduce the length of a given non-minimal element in F. 
More precisely, we state the following 

Conjecture 2.1 Let U n be the set of all non-minimal elements in F of length 
n and NU n C U„ the subset of elements which have Nielsen length-reducing 
automorphisms. Then 

lim ~i7TT = L 

n^oo \U n \ 

Our first heuristic is based on this conjecture and simply suggests to try 
Nielsen automorphisms first in the routine WLR, i.e., in this case we assume 
that in the fixed listing of automorphisms of W(X) the automorphism from 
N(X) come first. We refer to this heuristic as to Nielsen First and denote 
the corresponding Length Reduction Routine and the Whitehead Reduction 
algorithm (with respect to this ordering of W(X)) by WLRjvf and WRAjvf- 

The expected value of the number of steps for the routine WLR^r^ to find 
a length-reducing automorphism on an input u 6 F of length n is equal to 

P n \N(X)\ + (1 - P n )(\W(X)\ - \N(X)\), 

where P n = NU n /U n . 

Given that Conjecture 12. H is true, we expect WRAtvf to perform much bet- 
ter on average. In the next section we describe experimental results supporting 
this strategy. 

2.2 Cluster analysis 

According to the heuristic NF one has to apply Nielsen automorphisms to a given 
input w in some fixed order, which is independent of the word w. Intuitively, 
we expect some automorphisms to be more likely to reduce the length of a given 
word than the others. It suggests that the conditional probabilities 

Prob(\wt\ < \w\ \w), te N(X) 



may not be equal for different non-minimal words w £ F, so the order in which 
Nielsen automorphisms are applied to an input w should depend on the word 
w itself. 

The question we would like to address next is whether it is possible to find 
a dependence between a non-minimal word w and its length-reducing Nielsen 
automorphisms. For this purpose we employ methods from Statistical Pattern 
Recognition. 

Briefly, Pattern Recognition aims to classify a variety of given objects into 
categories based on the existing statistical information. The objects are typically 
presented by collections of measurements or observations (called features) which 
are real numbers. In this event the tuple of features that corresponds to a given 
object is called a feature vector, it can be viewed as a point in the appropriate 
multidimensional vector space M. d . Most of the approaches in Statistical pattern 
recognition are based on statistical characterizations of features, assuming that 
objects are generated by a probabilistic system. The detailed description of 
Pattern Recognition methods is out of scope of this report. We refer interested 
readers to ^0121 101 f° r general introduction to the subject, and |3] for applications 
of pattern recognition methods in groups. 

Unsupervised learning or clustering methods of pattern recognition are used 
when no a priori information about the objects is available. In this case there 
are general algorithms to group the feature vectors of objects into some "nat- 
ural classes" (called clusters) relative to the specified similarity assumptions. 
Intuitively, the objects whose feature vectors belong to the same cluster are 
more similar to each other than the objects with the feature vectors in different 
clusters. 

The most simple and widely used clustering scheme is called K -means. It is 
an iterative method. Let D — {xi, . . . , xjv} be a set of given objects, represented 
by the corresponding feature vectors x £ M. d . if -means begins with a set of K 
randomly chosen cluster centers //?,... , H° K G M d . At iteration i each feature 
vector is assigned to the nearest cluster center (in some metric || || on K d ). This 
forms the cluster sets C\ , . . . , C % K , where 

q = {x|||x-/4||<||x- / 4J|, xe£, m = l,...,K}. 

Then each cluster center is redefined as the mean of the feature vectors 
assigned to the cluster: 



1 v^ 



,*+l _ _ 

\ct 



xec* 
Each iteration reduces the criterion function J 1 defined as 

Jl = J2H ll*-/4ll- 

fc=lxeC£ 

As this criterion function is bounded below by zero, the iterations must 



converge. This method works well when clusters are mutually exclusive and 
compact around their center means. 

Here we claim that if -means algorithm allows one to discover some natural 
classes of non-minimal words. We show below that analysis of the corresponding 
cluster structures sheds some light on the relation between non-minimal words 
and their length-reducing automorphisms. 

We define features of elements w € F(X) as follows. Recall that the Labelled 
Whitehead Graph WG(w) — (V, E) of an element w £ F(X) is a weighted non- 
oriented graph, where the set of vertices V is equal to the set X ±x , and for 
Xi,Xj £ X ±1 there is an edge (xi,Xj) £ E if the subword %{Xj (or XjX~ ) 
occurs in the word w viewed as a cyclic word. Every edge (xi , Xj ) is assigned a 
weight lij which is the number of times the subwords XiX~ and XjX~ occur in 
w. 

Let l(w) be a vector of edge weights in the Whitehead Graph WG(w) with 
respect to a fixed order. We define a feature vector f(w) by 

/H = ]^['H- 

To execute the K-means algorithm one has to define in advance the expected 
number of clusters K. Since we would like these clusters to be related to the 
set of Nielsen automorphisms N(X) we put K = \N(X)\. 

To evaluate usefulness of the clustering we use the goodness measure R m ax 

defined below. Let C C D be a cluster of the data set D C F n — F(X). For 

t e N(X) define 

\{w e C \ \wt\ < \w\} 
R(t,C) = jgj ■ 

The number R(t, C) shows how many elements in C are reducible by t. Now put 

R max (C) - max{R{t,C) | t £ N(X)} 

and denote by tc a Nielsen automorphism t £ N(X) such that R(tc,C) — 
Rmax(C). The number R m ax{C) shows how many elements in C can be reduced 
by a single automorphism, in this case by tc- We also define the average value 
of the goodness measure 



1 K 

avg(R max ) — — £ ^RmaxjCj), 



where K is the number of clusters. 

The results of ff-mean cluster analysis of sets of randomly generated non- 
minimal elements in free groups F3, F4, F§ are given in Table |3| It shows that 
more that 70% of elements in every cluster can be reduced by the same Nielsen 
automorphism. In the free group -F3, where the number of clusters is significantly 
smaller, the corresponding percentage is over 98%. Moreover, our experiments 
show that td ^ tc for i =/= j. In other words there are no two distinct clusters 



Free group 


^3 


F 4 


^5 


number of clusters, K 


24 


48 


80 


avg(R max ), if-mcans 


0.985 


0.879 


0.731 



Table 3: Average values of the goodness measure R m ax for if-means clustering. 



such that one and the same Nielsen automorphism reduces most of the elements 
in both clusters. 

The discovered cluster structure gives rise to the following strategy in solving 
the Length Reduction Problem for a given word w. Let /xi, . . . , jjlk be the cen- 
ters of clusters C\ , . . . , Ck computed by the if-means procedure. We compute 
the distance ||/(w) — /z»|| for each % = 1, . . . , K. Now we list the Nielsen auto- 
morphisms in N(X) in the order tj 1 , ti 2 , . . . , U K with respect to the distances 



II/(«0-M<||/(uO-/k«|| 



< 



<\\f(w)-UH K \\. 



To find a length reducing automorphism for a given word w we subse- 
quently apply automorphisms from N(X) in the prescribed order until we find 
an automorphism £, G -^(X) which reduces the length of w. If such an auto- 
morphism does not exist we proceed with the remaining automorphisms from 
W{X) - N(X) as in the NF heuristic. 

From the description of the if-means method we know that clusters are 
characterized by the center means of the feature vectors of elements in the 
same cluster. The observations above lead us to the following vaguely stated 
conjecture, which gives a model to describe behavior of non-minimal elements 
from F in terms of their feature vectors. 

Conjecture 2.2 The feature vectors of weights of the Whitehead Graphs of el- 
ements from F are separated into bounded regions in the corresponding space. 
Each such region can be bounded by a hypersurface and corresponds to a par- 
ticular Nielsen automorphism in a sense that all elements in the corresponding 
class can be reduced by that automorphism. 



2.3 Improvement on the clustering 

Experiments with X-means clustering algorithm show that clustering is a useful 
tool in solving the length reduction problem. Now, the goal is to make clustering 
more effective. The further analysis of the clusters suggests that to some extent 
they correspond to partitions of elements in F which can be reduced by one 
and only one Nielsen automorphism. To verify this conjecture we perform the 
following experiment. 

Let S C F n = F(X) be a set of randomly generated non-minimal elements 
and D the set used for cluster analysis in the previous section. Note that S is 
generated independently from the set D. For each automorphism t G N(X) put 



O t = {w G S | Vr G N(X)(\wr\ < 



= *)} 



and define new cluster centers by 



1 



E 



fH 



(i) 



as the mean feature vector of the elements from S that can be reduced only by 
t and no other automorphisms. 

We cluster elements from D based on the distance between the corresponding 
feature vector and centers A*: 

C t = {w e D | Vr G N(X)(\\f(w) - X t \\ < \\f(w) - \ r \\)}. 

The results of evaluation of the clusters C t are given in Table 0J One can see 
that the goodness measure is improved and is close to 1 in every case. 



Free group 


F 3 


F 4 


^5 


number of clusters, K 


24 


48 


80 


avg{R max ), distance to At 


0.998 


0.993 


0.991 



Table 4: Average values of the goodness measure R m ax for the clustering based 
on the distance to the estimated centers A*. 



Similar to the strategy based on the centers of the if-means clusters, we 
define a new search procedure WRAc which employs a heuristic based on the 
distances to centers A t . Let w be a word and < A tl , . . . , \ tK > be the centers 
corresponding to each of the Nielsen automorphisms ti £ N(X). Put d(i) = 
\\f(w) — X ti || and construct a vector 



< d(mi), d(m 2 ), ■ ■ ■ , d(m K ) >, 



where 



d(mi) < d(mz) 



< 



< d{m K ). 



To find a length reducing automorphism for a given word w, the algorithm 
WRAc applies Whitehead automorphisms to w in the following order. First, 
Nielsen automorphisms t mi , . . . , t mK are applied subsequently. If none of the 
Nielsen automorphisms reduces the length of w the algorithm WRAc proceed 
with the remaining automorphisms W(X) — N(X) in some fixed order. 

Based on the results of the cluster analysis from Table 01 we expect the 
algorithm WRA^ to reduce a non-minimal word w using very few elementary 
automorphisms on average. 



2.4 Maximal weight edges 

Now we would like to take a closer look at the edges' weight distributions at 
the cluster centers. First, observe that every edge in the Whitehead graph WG, 
except for the ones which correspond to subwords of type x 2 , x E X , will 



correspond to subwords reducible by two particular Nielsen transformations. 
For example, edge connecting nodes a and b corresponds to subwords (ab^ 1 ) 
both of which arc reduced by automorphisms 



(o- 

(a 



ab, b 
a.b - 



ba). 



In fact there is no other Nielsen transformation that will reduce the length of 
words (a6 _1 ) ±1 . 

To generalize, let WG(w) be a Whitehead graph of a word w with the vertex 
set V and the set of edges E. Let e = (x, y^ 1 ), x, y^ 1 £ V, be an edge in E. By 
construction e corresponds to subwords s e — (xy) of the word w. The only 
Nielsen automorphisms which reduce length of the subwords s e are 



and 



V>* -X 



i>l -y 



xy 



V, z 



z, Vz ^ x, z £ X 



Z, Vz ^ y, z e X. 



We will call automorphisms ip^i^e the length reducing Nielsen automorphisms 
with respect to the edge e = (x, y^ 1 ) and denote tp e = {ip*, ^f }■ 

The following phenomenon has been observed for all clusters in free groups 
i*3, F4, and F$. Let C t be a cluster of a test set D n , n = 3,4,5, then for all 
t(=N(X), 

where e max is the edge having the maximal weight in the cluster center X t . 
It suggests that at least in the case of free groups F 3 , F 4 , F 5 one can try to 
estimate a length-reducing automorphism for given word w by taking the length- 
reducing Nielsen automorphisms of the highest weight edge in the Whitehead 
graph WG(w). 

To evaluate the goodness of the heuristic based on the maximal edge weight 
in the Whitehead graph we compute the fraction of elements in the sets D3, D4 
and D5 , reducible by the Nielsen automorphisms corresponding to the maximal 
weight edge. The corresponding goodness measure, evaluated on a set D, is 
given by 



Qmax 



1 

Jd 



\{w e D I 3t £ 1pe mcLX (w), s.t. |tyt| < \w\ 



Dataset 


D 3 


D 4 


D 5 


Qmax 


0.991 


0.986 


0.986 



Table 5: Values of the goodness measure Qmax for sets of non-minimal elements 
in free groups F3, F4 and F$. 



Values of the goodness measure Gmax for test sets in free groups F 3 , F4 
and -F5 are given in Table |S] It shows, that heuristic is surprisingly effective. 
Nevertheless, centroid based method still yields better results. Note that Qmax 
measures success of applying two automorphisms corresponding to the maximal 
weight edge, where the centroid based method was evaluated by the success rate 
of only one automorphism which corresponds to the closest center. 

The observation provides a new search procedure which we denote by WRAmax ■ 
Let wbea word and WG(w) = (V, E) be the corresponding Whitehead graph. 
Denote by E' the set of edges which do not correspond to the subwords of type 
x ±2 , x£X 

E' = {eeE\ e^ (v, t^ 1 ), V £ V}. 

It has been shown above, that for each edge e from E' there exists two unique 
length reducing automorphisms. Note that 2\E'\ = \N(X)\, where N(X) is the 
set of Nielsen automorphisms for free group F(X). 

We can order Nielsen automorphisms tp ei C N(X): 

< i>e 1 ,lpe 2 ,---,' l Pe lE , l > (2) 

such that edges ei, . . . , e\E'\ are chosen according to the decreasing order of the 
values of the corresponding weights 

W ei > UJe 2 > ••• > w e|B/| . 

Note that ip e is not a single automorphism, but a pair of Nielsen length reducing 
automorphisms with respect to the edge e. Here we do not give any preference 
in ordering automorphisms in i/j e . 

To find a length-reducing automorphism for w procedure WRAmax first 
applies Nielsen automorphisms in the order given by (0) . If none of the Nielsen 
automorphisms reduces the length of w, WRAmax proceeds with the remaining 
automorphisms from W(X) — N(X). 

3 Comparison of the strategies 

In this section we describe experiments designed to compare the performance of 
WRA implemented with different search strategies. We compare four variations 
of the algorithm. WRA^ is the variation of WRA, where a random order of the 
elements from W n is used when searching for a length reducing automorphisms. 
WRAwf and WRAc correspond to the implementations with Nielsen First 
and Centroid based heuristics respectively. The algorithm WRAAf ax employs 
strategy which applies automorphisms corresponding to the largest edge weights 
of the Whitehead Graph. The algorithms were compared on randomly generated 
sets of primitive elements S3, S4, S 5 in free groups F 3 , F 4 , and F 5 , respectively. 
Some descriptive statistics of the test sets S n are given in Tabic El 

Let A be one of the variations WRA fl , WRAjvf, WRA C , and WRAmax of 
the Whitehead Reduction Algorithm. By an elementary step of the algorithm A 
we mean one application of a Whitehead automorphism to a given word. Below 



10 



Dataset 


Group 


Dataset Size 


Min. length 


Avg. length 


Max. length 


s 3 


^3 


5645 


3 


1422.1 


143020 


S4 


F 4 


5241 


4 


2513.1 


168353 


s 5 


F 5 


3821 


5 


2430.5 


160794 



Table 6: Statistics of the test sets of primitive elements. 



we evaluate the performance of A with respect to the number of elementary 
steps that are required by A to execute a particular routine. 

Let N total = N to tai(A, S n ) be the average of the total number of elementary 
steps required by A to reduce a given primitive element w G S n to a generator. 

By N re d = N re d(A, S n ) we denote the average number of elementary length- 
reducing steps required by A to reduce a given primitive element w G S„ to a 
generator, so N re d is the average number of "productive" steps performed by 
A. It follows that ii ti, . . . ,ti are all the length reducing automorphisms found 
by A when executing its routine on an input w G S n then \wt\ ■ . -U\ = 1 and 
the average value of I is equal to N re d- 

Finally, denote by Nlrp — Nlrp(A, S n ) the average number of elementary 
steps required by A to find a length-reducing automorphism for a given non- 
minimal input w. 

In Tabled we present results of our experiments on performance of the algo- 
rithms WRA fl , WRAjv F , WRA C and WRA M ax on the test sets S n , n = 3, 4, 5. 
The algorithms compare as expected. The algorithms WRAc and WRA^ax 
perform very efficiently with the numbers Nlrp and N re d being small. Algo- 
rithm WRAc based on the centroid approach shows best over all performance. 
The growth of the numbers Nlrp with the rank could be explained by occa- 
sional occurrence of non-minimal words that cannot be reduced by Nielsen au- 
tomorphisms. In this event the algorithm tries Whitehead automorphisms from 
W(X) — N(X) the number of which growth exponentially with the rank. Notice 
that in every our experiment the number N re d of length reductions performed 
by WRAc is less than the corresponding number in the other approaches. 

In Table |H1 we give the correlation coefficients showing dependence of the 
number of elementary steps required by a particular algorithm to find a length- 
reducing automorphism with respect to the length of the input words. The 
coefficients are negative in all cases which indicates that the values of Nlrp do 
not increase when the words' length increases. 

4 Conclusions 

The experimental results presented in this paper show that using appropriate 
heuristics in the algorithm WRA, one can significantly reduce the complexity 
of the Whitehead minimization problem on most inputs with respect to the 
group rank. Suggested heuristics reduce the average number of Whitehead 
automorphisms required to find a length-reducing automorphism for a given 
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Strategy 


Ntotal 


N re d 


N L rp 


WRA C 


19.9 


18.4 


1.1 


WRAmax 


47.1 


23.9 


1.9 


WRAjvf 


207.8 


28.2 


7.37 


WRA fl 


374.8 


29.8 


12.6 


a) ^3; 


Strategy 


Ntotal 


N re d 


N L rp 


WRAc 


58.8 


34.1 


1.4 


WRAmax 


152.8 


42.5 


3.0 


WRAjvF 


1052.6 


56.2 


18.7 


WRAf 


2610.4 


58.8 


44.4 


b)^ 4 ; 


Strategy 


Ntotal 


^ red 


N L rp 


WRA C 


162.0 


50.9 


2.4 


WRAmax 


342.2 


58.8 


4.5 


WRAatf 


2307.6 


75.4 


30.6 


WRA fl 


15939.6 


78.8 


202.0 



c)F 5 . 

Table 7: Results of experiments with sets of primitive elements in free groups 
F 3 ,F4 and F 5 . Counts are averaged over all inputs. 



word. The performance of heuristic algorithms tested on the sets of randomly 
generated primitive elements shows robust behavior and does not deteriorate 
when the length of the input words increases. 

One of the interesting contributions of this paper is the empirically discov- 
ered properties of non-minimal elements of free groups formulated in Conjec- 
tures |0] and [^1 These conjectures suggest that the length of a "generic" 
non-minimal elements in a free group can be reduced by a Nielsen automor- 
phism. Moreover, the feature vectors of the weights of the Whitehead's Graphs 
of non-minimal elements are divided into " compact" regions in the correspond- 
ing vector space. Each such region is related to a particular Nielsen automor- 
phism, that reduces the length of all elements in the region. We believe this 
is one of those few cases when a meaningful rigorous, but not intuitively clear 
conjecture, in group theory was obtained by using experimental simulations and 
statistical analysis of the problem. 

It remains to be seen why the algorithm WRAc is able to find minimal 
elements using a smaller number of length reductions on average. We are going 
to address this issue in the subsequent paper. 
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Strategy 


^3 


F 4 


^5 


WRA C 


-0.008 


-0.001 


-0.006 


WRAmax 


-0.024 


-0.023 


-0.038 


WRAatf 


-0.009 


-0.022 


-0.022 


WRA fl 


-0.038 


-0.014 


-0.035 



Table 8: Correlation coefficients between words length and values of Nlrp- Neg- 
ative coefficients indicate that Nlrp does not increase when length increases. 
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